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Abstract. 

In these notes, we describe the recent progress in understanding the zero sets of 
two remarkable Gaussian random functions: the Gaussian entire function with invariant 
distribution of zeroes with respect to isometrics of the complex plane, and Gaussian 
spherical harmonics on the two-dimensional sphere. 

These notes consist of two almost independent parts. In both of them, we talk 
about zeroes of special Gaussian random functions. To understand them, we had 
to combine various tools from complex and real analysis with rudimentary proba- 
bilistic methods. We think that the results and techniques presented here can serve 
as guidelines in other problems of similar nature arising in analysis, mathematical 
physics, and probability theory. 

The function F that we consider in the first part is a random analytic function 
of one complex variable. In this case, one can recover the zeroes of F by applying 
the Laplacian to log|i^|. This paves the way for using complex analysis tools, 
and for this reason, the problems that we discuss in the first part are pretty well 
imderstood by now, though some intriguing questions still remain open. 

In the second part, we deal with topological properties of the zero sets of 
random (real- valued) functions of several real variables. This is an area with wealth 
of interesting and difficult questions and with very few advances. In essence, in 
this part, the reader will find a discussion of one recent theorem on the number 
of connected components of zero sets of Gaussian spherical harmonics along with 
various open questions. 



Part I. Random complex zeroes 



The study of zeroes of random polynomials and random analytic functions has 
a long history. It started with the pioneering works of Kac, Littlcwood, Offord, 
Rice, and Wiener, and was later continued by Hammersley, Kahane, Maslova, and 
many others. The subject was revived in the 1990's by several groups of researchers 
(Bogomolny-Bohigas-Leboeuf, Shub-Smale, Edelman-Kostlan, Ibragimov-Zeitouni, 
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Hannay, Bleher-Shiffman-Zelditch, Nonnenmacher-Voros) who came from very dif- 
ferent areas and established new links to mathematical physics, probability theory, 
and complex geometry. Some of these results were surveyed in the lectures by 
Zelditch [SJ and Sodin [44]; see also an introductory article [33] and the recent 
book by Hough, Krishnapur, Peres, and Virag [21] . 

In particular, Kostlan, Bogomolny-Bohigas-Leboeuf, Shub-Smalc, and Hannay 
introduced a remarkable construction of random Gaussian entire functions with 
translation invariant distribution of their zeroes. Let 



ri>0 

where ^„ are independent standard complex Gaussian random coefficients (i.e., 
the density of Cfe with respect to the Lcbesgue measure in C is ^e~l'»l ). The 
distribution of the random function F is invariant with respect to rotations around 
the origin, but it is not translation invariant, for instance, because f |i^(z)p = 
e'^l (here and below, E means the mathematical expectation). However, the 
distribution of the zero set Z = F~^{0} is translation invariant. One of the 
ways to sec this is to check that the Gaussian random function 

has the same covariance function as F: 



£{F{z)Fiw)} ^ £{Fxiz)Fx{w)} = e/^ 

which is nothing else but the reproducing kernel in the classical Fock-Bargmann 
space of entire functions. This coincidence is not accidental [33]. Moreover, due to 
remarkable Calabi's rigidity [211 Section 2.5], this is the only translation invariant 
zero set of a Gaussian entire function up to scaling. We call the function F the 
Gaussian Entire Function (G.E.F., for short). 

It is worth mentioning that there exist similar constructions for other domains 
with transitive groups of isometrics (the hyperbolic plane, the Riemann sphere, 
the cylinder and the torus). 



1. Linear statistics 

One of the most traditional ways to study asymptotic properties of a random point 
process Z in the plane is to take a test-function h, and to look at the asymptotic 
behaviour of the linear statistics 

as r — >■ oo. We put n(r, h) = nz{r, h). An easy computation shows that 

r2 f 

£n{r, h) = — / h . 
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If h = 1e is the indicator function of a set E, then n{r, 1e) = n{rE) is the number 
of zeroes in the set rE. 

A usual "triad" in the study of linear statistics is 

VARIANCE, ASYMPTOTIC NORMALITY, LARGE FLUCTUATIONS 

First, we'll discuss the variance, which is the easiest part of the triad. 
1.1. The variance. 

Theorem 1.1 (The variance). For every non- zero function h S {L^ D L'^)(M.'^) and 
every R > 0, 



where 



and 



Yai-nir,h) = r^ / |/i(A)|^Af(r"^A) dm(A) 



a>l 



h{X) = / /i(a;)e-^'^'<^'^>dm(a;) 

is the Fourier transform of h. 

This theorem was proven in |34j . The asymptotic of the variance had been 
known for two special cases since the work by Forrester and Honner [IT : if /i £ Cq 
(i.e., /i is a C^-function with compact support), then 

Varn(r,/i)==^^^±^||A/i||i., r ^ oo , (1.1) 
while for bounded domains G with piecewise smooth boundary, 

Varn(rG) = "^"^^ r L{dG) , r ^ oo . (1.2) 

Here, C( ' ) is Riemann's zeta- function. 

A less precise form of Theorem 11.11 might be more illustrative: 

Varn(r,/i) ~ /" |/J(A)nA|'' dm(A) + /" \h{\)\^ dm{\) , (1.3) 

J\\\<r "'|A|>r 

where the notation A c^i B means that the quotient _B/A is bounded from below 
and from above by positive numerical constants. The right-hand side of ()1.3p 
interpolates Il/i||i2(„) and \\/^h\\\.2(,^,,y 
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1.2. Digression: "superhomogeneous" point processes. 

By (|1.2I) , the random zero process Z belongs to the family of so called superho- 
mogeneous translation invariant point processes with fluctuations of the number 
of points in large domains proportional to the length of the boundary rather than 
to the area, as it would be, say, for the Poisson process. 
A "toy model" for such processes is the point process 



(1.4) 



obtained by perturbing the lattice -^/tt 1? by independent standard complex Gaus- 
sian random variables Q^. The normalization by ^pH is not essential here, it is 
introduced to have asymptotically the same mean number of points in large areas 
as our process Z has. The choice of the square lattice is not essential either. 

Curiously, the same kernel e^"" that occurs in the definition of random complex 
zeroes generates by a very different construction another interesting superhomoge- 
neous point process Q ^ namely, the determinantal process whose A:-point functions 
can be expresed in terms of the determinants formed by this kernel. 



li<ij<fc 



This process arises as the large N limit of eigenvalues of Ginibre ensemble of 
N X N matrices with independent standard complex Gaussian entries, and we will 
call it the limiting Ginibre process. It is known that the Ginibre point process is 



Figure 1. Samples of the Poisson process (figure by B. Virag), limiting Ginibre process, 
and zeroes of a GEF (figures by M. Krishnapur). Some properties of the last two processes 
are quite different, though the eye does not easily distinguish between them. 

a special, explicitly solvable case of a one-component plasma of charged particles 
of one sign confined by a uniform background of the opposite sign. Though the 
one-component plasma has been studied by physicists for a long time, it seems that 
most of rigorous results still pertain only to the very special case of the Ginibre 
ensemble. 

Resemblances and differences between the processes Z, S, and Q were discussed 
both in the physical and the mathematical literature. For instance, the behaviour 
of smooth linear statistics for these three processes is quite different. In particular, 
decay of the variance of smooth linear statistics (jl.ip distinguishes the zero process 
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Z from the processes Q and 5, since for the latter two processes, the variance of 
smooth linear statistics tends to the positive limit proportional to II V/i|||2(„)- 

1.3. Asymptotic normality of fluctuations. 

1.3.1. Normal fluctuations. We say that the linear statistics n(r, K) have asymp- 
totically normal fluctuations if the normalized linear statistics 

n(r, h) — £n{r, h) 
-^Varn(r, h) 

converge in distribution to the standard (real) Gaussian random variable as r— >oo. 

Let Cg , a > 0, be the class of compactly supported C"-functions, by Cg we 
denote the class of bounded compactly supported measurable functions. 

Theorem 1.2 (Asymptotic normality). Suppose that h G with some a > 0, 
and that for some £ > and for every sufficiently big r, we have 

Var n(r, /i) > r~2"+^ (1.5) 

Then the linear statistics n{r, h) have asymptotically normal fluctuations. 

Note that by (|1.3p . we always have Var n(r,/i) > c{h)r~''^ with positive c{h) 
independent of r. Hence, for a > 1, condition (jl.Sp holds automatically, and we 
obtain the following 

Corollary 1.1. Suppose that h e Cg with a > 1. Then the linear statistics n{r, h) 
have asymptotically normal fluctuations. 

Using estimate (jl.Sp . one can show that for any bounded measurable set E of 
positive area, Yav n{rE) > r, cf. (|1.2[) . Hence, 

Corollary 1.2. Let E be a bounded measurable set of positive area. Then the 
number of random complex zeroes n(rE) on the set rE has asymptotically normal 
fluctuations. 

1.3.2. Abnormal fluctuations of linear statistics. Do there exist Cg -func- 
tions h with abnormal asymptotic behaviour of linear statistics n(r, ft,)? The answer 
is "yes", and the simplest example is provided by the function ha — |a;|"'0(x), 
where ijj is a smooth cut-off that equals 1 in a neighborhood of the origin. Clearly, 
ha € Cg and it is not difficult to show that Varn(r, /i^) ~ r^^". This shows 
that Theorem 11.21 is sharp on a rough power scale. The reason for the loss of 
asymptotic normality is that only a small neighbourhood of the origin where ha 
loses its smoothness contributes to the variance of n{r,ha). This neighbourhood 
contains a bounded number of zeroes of F, which is not consistent with the idea 
of normal fluctuations. 
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1.3.3. Comments and questions. Theorem 11.21 was preceded by a result of 
Sodin and Tsirelson |451 Part I]. Using the moment method and the diagrams, 
they showed that the fluctuations are asymptotically normal provided that h £ Cq . 
Their technique works in several other cases, for instance, when h ~ 1q is the 
indicator function of a bounded domain G with a piecewise smooth boundary. 
However, it seems very difficult to adapt it for proving Theorem 11.21 in its full 
generality. 

In the case a > 0, the proof of Theorem II .21 uses a classical idea of S.Bernstein 
to approximate the random variable n{R, h) by a sum of a large number of indepen- 
dent random variables with negligible error. Such approximation becomes possible 
only after we separate the high and the low frequencies in h. In this approach, 
independence appears as a consequence of the almost independence of the values 
of the G.E.F. at large distances, which we'll discuss below in Section [XT] We do 
not know whether asymptotic normality holds for all functions h G Cq, or whether 
the condition r^" Varn(r, /i) — > oo is already sufficient for asymptotic normality of 
linear statistics associated with a Cq -function. Also, we believe that the assertion 
of Theorem 11.21 can be extended to functions h G C" n Lq with — 1 < a < but 
our current techniques seem insufficient to handle this case properly. 

In the case a = 0, the proof uses a different idea which comes from statistical 
mechanics. First, we show that fc-point functions of the zero process Z are cluster- 
ing, see Section for the precise statement. Then, using clustering, we estimate 
the cumulants of the random variable n{r, h). 

It is interesting to juxtapose Theorem 11.21 with what is known for the limiting 
Ginibre process Q described above. For bounded compactly supported functions h, 
a counterpart of Theorem 1 1.2 1 is a theorem of Soshnikov. In [47], he proved among 
other things that for arbitrary determinantal point processes, the fluctuations of 
linear statistics associated with a compactly supported bounded positive function 
are normal if the variance grows at least as a positive power of expectation as the 
intensity tends to infinity. A counterpart of the limiting case a = 2 in Theorem ll.2l 
(that is, of the result from [JSJ Part I]) was recently found by Rider and Virag 
in [42]. They proved that the fluctuations for linear statistics of process Q are 
normal when the test function h belongs to the Sobolev space Wi. It is not clear 
whether there is any meaningful statement interpolating between the theorems of 
Soshnikov and Rider and Virag. It can happen that our Theorem 11.21 simplv has 
no interesting counterpart for the process Q. It is also worth mentioning that the 
proofs in the determinantal case are quite different from ours. They are based on 
peculiar combinatorial identities for the cumulants of linear statistics that are a 
special feature of determinantal point processes. 

1.4. Probability of large fluctuations. 

Now, we turn to the probability of exponentially rare events that, for some r 3> 1, 
|n(r, /i) — £n{r,h)\ is much bigger than •\/Var(n(r, h). Mostly, we consider the 
case when h is the indicator function of the unit disk B; i.e., we deal with the 
number n(r) of random zero points in the disk of large radius r centered at the 
origin. Recall that £n{r) = and £{{n{r) — r^)^} ~ cr for r — > oo (with some 
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c > 0). Hence, given a > ^, we need to find the order of decay of the probability 
V{\n{r)-r'^\ > r"}. 

1.4.1. Naive heuristics. The aforementioned similarity between the zero pro- 
cess Z and independent complex Gaussian perturbations S of the lattice \pKl? 
helps to guess the correct answer. 

We fix the parameter ;y > 0, and consider the random point set 5,/ = {w + 
Ci^jlwez^i where are independent, identical, radially distributed random variables 
with the tails 'P{|Cw| > decaying as exp(— t"^) as t — )■ oo. Set 

n,(r) x/^Z^: |cj + Cc.| < r}. 

Then, for every a > ^ and every e > 0, 

exp[-r'^("''^)+"] < V{\n^{r) - r'^\ > r"} < exp[-r'^("''')-^] , 

provided that r is sufficiently big. Here 

{2a- 1, 5 < a < 1; 

(i^ + l)a - I/, 1 < a < 2; 
(ii/+l)a, a>2. 

Actually, one can find much sharper estimates for V{\n,^{r) — > r"}. 

This suggests that the probability 'P{\n{r) — r'^\ > r"} we are after should 
decay as exp[— r'^^"^] with 

{2a -1, i < a < 1; 
3a -2, 1 < a < 2; 
2a, a>2. 

1.4.2. Jancovici-Lebowitz-Manificat Law. Unfortunately, we do not know 
how to represent random complex zeroes as independent, or weakly correlated, 
Gaussian perturbations of the lattice points, so we cannot use the heuristics given 
above. Nevertheless, we can prove 

Theorem 1.3 (JLM Law for random complex zeroes). For every a > ^ and every 
e > 0, 

exp[-r'^(")+"] < r{\n{r) - r^] > r"} < exphr'^^")""] 
for all sufficiently large r > ro(a,e) with the same ip{a) as above. 

In [TB], Jancovici, Lebowitz and Manificat showed that this law holds for the 
one-component plasma. Their derivation was not a rigorous one, except for the case 
of the limiting Ginibre process Q. It would be desirable to have a clear explanation 
why the same Jancovici-Lebowitz-Manificat law holds for the random processes Z, 
S, and G in the range a > 1. 
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1.4.3. Comments and questions. The function ip from the exponent in the 
Jancovici-Lebowitz-Manificat Law loses smoothness at three points. Accordingly, 
there are three different regimes (i<Q;<l,l<Q!<2, and a > 2). The point 
a = i corresponds to the asymptotic normality of n{r), and deviations in the range 
i < a < 1 are called moderate. In this range, the deviation |n(r) — is small 
compared to the length of the circumference {|z| = r}. In this case, the theorem 
was proven by Nazarov, Sodin, and Volberg |37| . The point a = 1 corresponds 
to the classical large deviations principle. In the range 1 < a < 2, the deviation 
is already big compared to the length of the boundary circumference, but is still 
small compared to the area of the disk {|2:| < r}. In this case, the lower bound for 
V {\n{r) — r^l > r"} is due to Krishnapur |22j . while the upper bound was proven 
in [33. 

The case a — 2 contains an estimate for the "hole probability" V {7i{r) ~ 0}. 
In this case, the theorem was proved by Sodin and Tsirelson 0^1 Part III]. A very 
sharp estimate of the hole probability 

log-pjnM = 0} = --^r"* + O(r^), r 00 , 

was recently obtained by Nishry [3S]; in [32] be extended this asymptotics to a 
rather wide class of entire functions represented by Gaussian Taylor series. There 
are two interesting questions pertaining to the hole probability. We have no idea 
how to find the asymptotics of the expected number of random complex zeroes in 
the disk i?D, R > r, conditioned on the hole {n(r) = 0}. We also do not know 
how to extend Nishry's result from the unit disk to other bounded domains G. It 
seems plausible that for a large class of bounded domains G, 

logV {n{rG) 0} = -{k{G) + o{l))r'^ , r ^ 00, 

with k{G) > 0. If this is true, how does k,{G) depend on G? 

The range a > 2 in the Jancovici-Lebowitz-Manificat Law is the "overcrowding" 
regime. In [35], Krishnapur proved that for a > 2, 

\ogV {n{r) > r"} = - (ia - 1 + o(l))r2" log r , r ^ 00 . 

The bounds in Theorem 11.31 are not too tight. As we've already mentioned, in 
some cases, much better bounds are known. It would be good to improve precision 
of Theorem 11.31 in other cases. For instance, to show that for a < 2 and for (5 > 
there exists the limit 

logT' {|n(r) - r^l > ,5r"} 
lim J—. 

r-^oo r'^W 

and to find its value. 

1.4.4. Moderate deviations for smooth linear statistics. Here is a recent 
result of Tsirelson [52] : 
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Theorem 1.4. Let heC^. Then 

\ogP{rn{r,h) > ta\\Ah\\L2} = (1 + o(l)) log( ^ / e-^'/^dx) 

and 

\ogV{rn{r,h) <-ta\\/^h\\L2] = {l + o{l))\og[-= / e-^'/^dx), 

^ V 27r Jt ^ 

as r^oo,t>0, and t^-^ 0. Here, cr^ = ^ (cf. (HI]) ). 

The proof of this theorem is quite intricate. Note that it gives bounds that 
are much sharper than the ones in Theorem 11.31 In the case t = const, Theo- 
rem [L4] gives another proof of the asymptotic normality of smooth linear statistics 
of random complex zeroes. 

It is not clear whether the assumption '' — )■ can be replaced by a more 
natural one ^ 0. To the best of our knowledge, until now, there have been 
no results about large or huge deviations for smooth linear statistics of random 
complex zeroes when t is comparable or much larger than r. 



2. Uniformity of spreading of random complex ze- 
roes over the plane 

Let Z he & point process in with the distribution invariant with respect to the 
isometrics of W^. A natural way to check how evenly the process Z is spread over 
is to find out how far the counting measure 

nz = '^5a 

of the set Z {5a is the unit mass at a) is from the Lebesgue measure md in M'^. We 
describe a convenient way to measure the distance between nz and m^. 

Suppose that the mean number of points of Z per unit volume equals 1. We 
want to partition the whole space W^^ except possibly a subset of zero Lebesgue 
measure, into disjoint sets B{a) of Lebesgue measure 1 indexed by sites a Cz Z in 
such a way that each set B{a) is located not too far from the corresponding site 
a € Z. In other words, we are looking for a measurable map T-.W^^Z such that 
for each a e Z, we have md{T~^ {a}) = 1. Wc also want the distances \Tx — x\ 
to be not too large. The map T is called the transportation (a.k.a. "matching" , 
"allocation", "marriage", etc.) of the Lebesgue measure to the set Z. 

Alternatively, we can fix a lattice F C M"^ with cells of unit volume, and look 
for a bijection 0: F — Z for which the distances |97 — 7I, 7 G F, are not too 
large. Since for each two lattices Fi, F2 with cells of the same volume, there is a 
bijection 6': Fi — > F2 with sup {\9'-f — 71 : 76 Fi} < 00, the choice of the lattice is 
not important, so we can take F = Z'*. 
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Since we deal with random discrete sets Z , the corresponding transportation 
maps T (or the bijections 9) will be random maps. In interesting cases (including 
the random complex zeroes -Z), almost surely, the transportation distances |Ta; — a;| 
are unbounded, so we are interested in the rate of decay of the probabiUty tails 
V{\Tx-x\ > R} as CO. 

Here we present two approaches to this problem developed in [45, Part II] and 
in |36j . Though we discuss only the random complex zeroes Z, we believe that both 
approaches should work for other natural translation invariant point processes. At 
last, we recall that the random complex zero process Z has intensity tt, not 1. For 
this reason, we will look for a transportation of the measure 7rm2 to Z, and for a 
bijection between the lattice y/nl? and Z. 

2.1. Random complex zeroes as randomly perturbed lattice 
points. 

Theorem 2.1 (Existence of well-localized bijection). There exists a translation 
invariant random function ^ : — ^ C such that 

(a) the random set {7+^(7) : 7€ ^/^T'Z'^} is equidistributed with the random complex 
zeroes Z; 

(b) 'P{\({0)\ > R} < exp {-cR^/logR) for some c> and every R>2. 

The theorem is almost optimal since the probability that the disk of radius 
A > 1 is free of random complex zeroes is not less than exp (— CA^). It seems that 
the question about the existence of a matching between the lattice and Z with 
tails decaying as exp (— cA'*) remains open as well as the same question for the 
Gaussian perturbations S of the lattice points and for the limiting Ginibre process 

g. 

It would be interesting to find a version of Theorem 12 . II with weakly correlated 
perturbations £,kj at large distances. This could shed some light to the reasons 
hidden behind the Jancovici-Lebowitz-Manificat Law. 

2.1.1. Uniformly spread sequences in M**. The proof of Theorem l2.1] is based 
on a deterministic idea which might be useful in study of the uniformity of spread- 
ing of sequences and measures. We need to establish the bijection between the sets 
Z and \pK'I? with controlled tails of the distances l^^fc,;!- First we look at a simpler 
situation when \ are uniformly bounded. It is too much to expect from a typical 
zero set, but let us try anyway. We say that the set Z S is r-uniformly spread 
over M'' (with density 1) if there exists a bijection between Z and a lattice with the 
unit volume of the cell such that the distances between Z and the corresponding 
lattice points do not exceed r. If such a bijection exists then clearly 

n{U) < v{U+r) and v{U) < n{U+r) (2.1) 

for every ?7 C M*^; here U+r stands for the r-neighbourhood of ?7, n is the counting 
measure of the set Z , and v is the counting measure of the lattice. In fact, (|2.ip is 
not only necessary but also sufficient^ which is basically a well-known locally finite 
marriage lemma due to M. Hall and R. Rado. When verifying condition \2.1\ . we 
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can replace v by the Lebesgue measure at the expense of adding a constant 
to r . 

Now, given a locally finite measure [i on M**, we define Di(^) as the infimum of 
r g (0, oo) such that 

< md(X+,.) and md{X) < fJ.{X+r) 

for every bounded Borel set X cM.'^. The range of Di is [0, +oo] with the both 
ends included. The following theorem gives a useful upper bound for Di(/i) in 
terms of the potential u: 

Theorem 2.2 (Upper bound for the transportation distance). Let u be a locally 
integrable function in such that Au = /i — in the sense of distributions. 
Then 

Di(/i) < Constd • inf {r + y^\\u * Xr||oo I • 

r>0 

Here, Xr 'is the indicator function of the ball of radius r centered at the origin 
normalized by the condition \\Xr\\L^ = 1; and * denotes the convolution. 

Now, we explain how Theorem l2.1l is deduced from Theorem l2.2l After smooth- 
ing, the random potential U{z) — log — is locally uniformly bounded. 
Still, a.s. it remains unbounded in C, so we cannot apply Theorem 12.21 directlv. 
The idea is to introduce on C a random metric p that depends on a G.E.F. F. 
The metric p is small where the random potential U is large. Then we apply a 
counterpart of Theorem 12.21 with the distances measured in the metric p, instead 
of the Euclidean one. 

2.1.2. Comments. Theorems l2.1l and l2.2l are taken from Sodin and Tsirelson [JSJ 
Part II] (cf. [i6\ ) . Ill that paper, the authors proved a weaker subgaussian estimate 
for the tails, however, after a minor modification of the proof given therein, one 
gets the result formulated here. Note that the method developed in Sodin and 
Tsirelson [45l Part II] needs only the existence of a stationary random vector field 
V in M.'^ with divu = p — Cdmd- The tail estimate depends on the rate of decay of 
the tails of the field v or of the tails of the potential u such that v = Vu (if such 
a u exists). 

In the last 20 years, the concept of uniformly spread discrete subsets of has 
appeared in very different settings. Laczkovich used uniformly spread sets in in 
his celebrated solution of the Tarski's circle squaring problem [23] (see also [24j). 
There are various probabilistic counterparts of this notion. For instance, Ajtai, 
Komlos and Tusnady [I], Leighton and Shor and Talagrand [5T] studied a 
finite counterpart of this, namely, a high probability matching of a system of N"^ 
independent random points in the square [0, NY C with the grid n [0, N)"^. 

2.2. Gradient transportation. 

Unfortunately, the proof of Theorem 12.11 is a pure existence one. It gives us no 
idea about what the (almost) optimal transportation of the Lebesgue measure to 
the zero process Z looks like. Now, we discuss another approach, namely, the 
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transportation by the gradient flow of a random potential. The main advantage 
of this approach is that it yields a quite natural and explicit construction for the 
desired transportation. 

2.2.1. Basins of zeroes. Let U{z) = log |i^(2)| — be the random potential 
corresponding to the G.E.F. F. It is easy to check that the distribution of U is 
invariant with respect to the isometrics of the plane. We shall call any integral 
curve of the differential equation 

a gradient curve of the potential U. We orient the gradient curves in the direction 
of decrease of U (this is the reason for our choice of the minus sign in the differential 
equation above). If z ^ Z, and \7U{z) ^ 0, by we denote the (unique) gradient 
curve that passes through the point z. 

Definition 2.1. Let a be a zero of the G.E.F. F. The basin of a is the set 

B{a) = {z e C \ Z : VC/(z) 7^ 0, and terminates at a} . 

Clearly, each basin B{a) is a connected open set, and B{a') D B{a") = if a' 
and a" are two different zeroes of F. Remarkably, all bounded basins have the same 
area tt. Indeed, ^ = on dB{a) and therefore, applying the Green formula and 
recalling that the distributional Laplacian of U equals AJ7 = "^T^^aeZp ^a- ^ 
one gets 

1 - "IlE^A = — I I AU(z) dm(z) = — / ^(z) \dz\ = ; 



^// AU{z)dm{z) = ^ f ^{z)\dz\ 
JJB(a) 27r Jq^u) on 



J(a) ^" JdB{a) 

i.e., mB{a) = tt. The picture below helps to visualize what's going on. 
2.2.2. Results. 

Theorem 2.3 (Random partition). Almost surely, each basin is bounded by finitely 
many smooth gradient curves ( and, thereby, has area tt ), and 



c = y i3(c 



ae2 

up to a set of measure ( more precisely, up to countably many smooth boundary 
curves). 

The tails of this random partition have three characteristic exponents 1, |, 
and 4. The probability that the diameter of a particular basin is greater than R 
is exponentially small in R. Curiously enough, the probability that a given point 
z lies at a distance larger than R from the zero of F it is attracted to decays 
much faster: as e~^**^ . This is related to long thin tentacles seen on the picture 
around some basins. They increase the typical diameter of the basins though the 
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Figure 2. Random partition of the plane into domains of equal area generated by the 
gradient flow of the random potential U (figure by M. Krishnapur). The lines are gradient 
curves of U , the black dots are random zeroes. Many basins meet at the same local 
maximum, so that two of them meet tangentially, while the others approach it cuspidally 
forming long, thin tentacles. 

probability that a given point z lies in such a tentacle is very small. At last, given 
e > 0, the probability that it is impossible to throw away e% of the area of the 
basin so that the diameter of the remaining part is less than R decays as e"^ . 
All three exponents arc optimal. The proofs of these results rely on the following 
long gradient curve theorem. 

Theorem 2.4 (Long gradient curve). Let R> 1. Let Q{R) be the square centered 
at the origin with side length R. The probability of the event that there exists a 
gradient curve joining dQ{R) with dQ{2R) does not exceed Ce"'^^^^"^^^ ^ . 

The proof of this theorem is, unfortunately, rather long and complicated. It 
might be helpful for the reader to look at the first version of [351 posted in the 
arxiv where the authors gave a more transparent proof of a weaker upper bound 
(j^-cR^\ogR ^YiQ long gradient curve theorem. 

2.2.3. Comments and questions. Gradient transportation was introduced by 
Sodin and Tsirclson [55] Part II] and studied by Nazarov, Sodin, Volbcrg in [55] . 

There arc several questions related to the statistics of our random partition of 
the plane. It is not difficult to show that, almost surely, any given point z € C 
belongs to some basin. We denote that basin by B^, and the corresponding sink 
by az- We say that two basins are neighbours if they have a common gradient 
curve on the boundary. By Nz we denote the number of basins B neighbouring 
the basin Bz- Clearly, Nz equals the number of saddle points of the potential U 
connected with the sink by gradient curves. Heuristically, since almost surely 
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each saddle point is connected with two sinks, 

2 mean number of saddle points per unit area 
^ mean number of zeroes per unit area 

Douglas, Shiffman and Zclditch proved in [TS] that the mean number of saddle 
points of U per unit area is (They proved this for another closely related 
"elliptic model" of Gaussian polynomials. It seems that their proof also works 
for G.E.F.'s) This suggests that S-N^ = |. Another characteristic of the random 
partition is the number of basins that meet at the same local maximum. Taking 
into account the result from [15], we expect that its average equals 8. 

The next question concerns the topology of our random partition of the plane. 
By the skeleton of the gradient flow we mean the connected planar graph with 
vertices at local maxima of U and edges corresponding to the boundary curves 
of the basins. The graph may have multiple edges and loops. We do not know 
whether there are any non-trivial topological restrictions on finite parts of the 
skeleton that hold almost surely. 

In [TT] Chatterjee, Peled, Peres, Romik applied the ideas from [3^ to study 
the gradient transportation of the Lebcsguc measure to the Poisson point process 
in R'' with d > 3 (they called it 'gravitational allocation'). Their work required a 
delicate and thorough analysis of the behaviour of the Newtonian potential of the 
Poisson point process. It's worth mentioning that a very different construction of 
the stable marriage between the Lebcsguc measure and the Poisson process in 
R"^ with d > 2 was developed by Hoffman, Holroyd and Peres in [5D]. The case 
d = 2 is especially interesting: see the recent work by Holroyd, Pemantle, Peres, 
Schramm [l9| . 



3. Almost independence and correlations 

3.1. Almost independence at large distances. 

The covariance function of the normalized Gaussian process F*{z) = F(z)e~^l^l 
equals 

which decays very fast as jz — w| grows. This suggests an idea that the zeroes 
of G.E.F.'s must be "almost independent" on large distances. Still the precise 
formulation of this independence property is not obvious: due to analyticity of F, 
if we know the process F* in a neighbourhood of some point, we know it everywhere 
on the plane. 

It is not difficult to show that two standard complex Gaussian random variables 
with small covariance can be represented as small perturbation of two independent 
standard complex Gaussian random variables. Developing this idea, we show that 
if {Kj} is a collection of well-separated compact sets, then the restrictions i^* I ^ of 

normalized process F* can be simultaneously approximated by restrictions F* | 
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of normalized independent realizations of G.E.F.'s Fj with high precision and the 
probability very close to 1. This is a very useful principle that lies in the core of the 
proofs of most of the results described above. Here is the precise statement [33] : 

Theorem 3.1 (Almost independence). Let F be a G.E.F.. There exists a numer- 
ical constant A > 1 with the following property. Given a family of compact sets 
Kj in C with diameters d{Kj), let Xj > ma,yi{d{Kj),2} . Suppose that Ay^log Xj- 
neighbourhoods of the sets Kj are pairwise disjoint. Then 

F*:^F*+G* onKj, 

where Fj are independent G.E.F.'s and for every j , we have 

P{max|G*|>A-i}<e-^^. 

Less general versions of this result were proven in [37] . 

The proof of Theorem 13.11 goes as follows. First, for each compact set Kj, we 
choose a sufficiently dense net Zj and consider the bunch Nj = {vz - z & Zj^ of 
unit vectors Vz = F* (z) in the Hilbert space of complex Gaussian random variables. 
Since the compact sets Kj arc well-separated, the bunches Nj arc almost orthog- 
onal to each other. Then wc slightly perturb the vectors Vz without changing the 
angles between the vectors within each bunch Nj , making the bunches orthogonal 
to each other. More accurately, we construct new bunches Nj = {vz : z & Zj} so 
that for z Cz Zj, C. Cz Zk, 



Vz,V(;) 



{vz,Vi^) forj = fc, 
for j ^ k 



with good control of the errors \\vz — Vz\\. Then we extend the Gaussian bunches 
{vzC^^^^^ : z G Zj] to independent G.E.F.'s Fj. The difference Gj = F — Fj is a 
random entire function that is small on the net Zj with probability very close to 
one. At the last step of the proof, using some simple complex analysis, we show 
that G* is small everywhere on Kj. 

3.2. Uniform estimates of fc-point functions. Clustering. 

There is yet another way (originated in statistical mechanics) to describe point 
processes by the properties of their fc-point correlation functions. Recall that the 
fc-point function p = pk oi the zero process -Z is a symmetric function on C*^ defined 
outside of the diagonal subset 

Diag(C'') = {(zi, Zk) : Zj = Zj for some i ^ j} 

by the formula 



/ N ,• Peizi,...,Zk) . 

p{zi, Zk) = hm (3.1) 
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where Pe{zi, ■■■,Zk) is the probability that each disk {\z — Zj\ < e}, 1 < j < k, 
contains at least one point of Z. The fc-point functions describe correlations within 
fc-point subsets of the point process. Estimates for the fc-point functions are crucial 
for understanding many properties of point processes. The following results taken 
from j35j provide rather complete quantitative information about the behaviour of 
the fc-point functions of random complex zeroes. 

The first result treats the local behaviour of fc-point functions. It appears 
that for a wide class of non-degenerate Gaussian analytic functions, the fc-point 
functions of their zeroes exhibit universal local repulsion when some of the variables 
zi, Zk approach each other. 

Recall that a Gaussian analytic function f(z) in a plane domain G C C is the 
sum 

n 

of analytic functions fn{z) such that 

< oo locally uniformly on G, 

n 

where C„ are independent standard complex Gaussian coefficients. By pf = 
Pf[zi,...,Zk) wc denote the fc-point function of the zero set of the function /. 
It is a symmetric function defined outside the diagonal set Diag(G'"') as in p.ip . 

We skip the technical definition of d-degeneracy, which we use in the assump- 
tions of the next theorem, and only mention that Gaussian Taylor series (either 
infinite, or finite) 

Tl>0 

are d-nondegenerate, provided that cp, ci, Cd-i 7^ 0. In particular, the G.E.F. is 
d-nondcgcncratc for every positive integer d. 

Theorem 3.2 (Local universality of repulsion). Let f be a 2k-nondegenerate Gaus- 
sian analytic function in a domain G, let pf be a k -point function of zeroes of f , and 
let K G G be a compact set. Then there exists a positive constant C ~ C{k, f, K) 
such that, for any configuration of pairwise distinct points Zi, ...,2;^ S K, 

C-^Y[\z,-z,\^ <Pf{zi,...,Zk) < GY[\z,-z,\\ 

The next result is a clustering property of zeroes of G.E.F. 's. It says that 
if the variables in C*^' can be split into two groups located far from each other, 
then the function pk almost equals the product of the corresponding factors. This 
property is another manifestation of almost independence of points of the process 
at large distances. It plays a central role in the proof of the asymptotic normality 
theorem 11.21 for bounded measurable functions. 

For a non-empty subset I ^ {ii, ii} C {1, 2, fc}, we set Zj = {zi-^ , z^^}. 
We denote by 

d{Zj, Zj) = inf \zi — Zj\ 
ieijeJ 
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the distance between the configurations Zi and Zj. 

Theorem 3.3 (Clustering property). For each k > 2, there exist positive constants 
Ck and Afc such that for each configuration Z of size k and each partition of the set 
of indices {1, 2, k} into two non-empty subsets I and J with d{Zi, Zj) > 2Ak, 
one has 

l_e<— 4^^<l + £ with e = Cfce-^('^(^-^-')-^'=)\ (3.2) 
p{Zi)p{Zj) 

Combining Theorems 13.21 and 13.31 and taking into account the translation in- 
variance of the point process Z, we obtain a uniform estimate for pk vahd in the 
whole C*^: 

Theorem 3.4. For each k > 1, there exists a positive constant Ck such that for 
each configuration (zi, Zk), 



z,. - z 



Ck^^^{\z^-z,\)<p{z^....,zk)<Ck'W^{\ 

i<j i<j 

where £{t) = min(t^, 1). 

The proofs of Theorems l3.2l and l3.3l start with the classical Kac-Rice-Hammersley 
formula [3TJ Chapter 3]: 



Pf{zi,...,Zk) = / |77i| ...|?7fc| Vf{ri';zi,...,Zk)dm{rji)...dm{r]k), (3.3) 

where !?/(•; zi, z^) is the density of the joint probability distribution of the 
random variables 

/(zi), /'(zi), ...,/(zfc), f{zk), (3.4) 

and 77' = (0,771, ...,0,77^)^^ is a vector in C^'^. Since the random variables p.4p are 
complex Gaussian, one can rewrite the right-hand side of p.3p in a more explicit 
form 

Pf{zu...,zk)^ / |77ip...|77fcpe-^<^/""''''>d7n(77i)...dm(77fc), (3.5) 

n'^" act 1 / J^k 

where F/ = T f{zi, Zk) is the covariance matrix of the random variables p.4p . 
We consider the linear functionals 

k 

where 



18 



Fedor Nazarov and Mikhail Sodin 



and 7 C if is a smooth contour that bounds a domain G' C K that contains the 
points zi,...,Zk. Observe that for every vector S = (ai, Qfc, in C^*^, we 
have 

{rjS,6)=£\Lf\\ 

This observation allows us to estimate the matrix T^^, and hence the integral on 
the right-hand side of (|3.5|) . using some simple tools from the theory of analytic 
functions of one complex variable. 

We note that using another approach to analyzing the right-hand side of (|3.5p , 
Bleher, ShifFman, and Zelditch proved in [5] that if the points Zi are well separated 
from each other, i.e., 

min \zi — Zj \ > 5 > 0, 

then some estimate similar to p.2p holds with a factor C{k,6) instead of Cfc. 
Unfortunately, in this form the result is difficult to apply. For instance, it does 
not yield the boundedness of the fc-point functions on the whole C'^ , and we could 
not use it for the proof of Theorem 11.21 On the other hand, the result of Bleher, 
Shiffman and Zelditch is valid for a wider class of zero point processes. 



Part II. Random nodal lines 

4. Gaussian spherical harmonic and Gaussian plane 
wave 

We introduce two remarkable Gaussian random functions closely related to each 
other: the Gaussian spherical harmonic on the two-dimensional sphere §^ and its 
scaling limit, the Gaussian plane wave. The study of random plane waves, and 
in particular, of their nodal portraits, originated in applied mathematics and goes 
back to M. S. Longuet-Higgins [27] who computed various statistics of nodal lines 
for Gaussian random waves in connection with the analysis of ocean waves. One of 
the reasons for the recent interest in random plane waves is the heuristic principle 
proposed by M. V. Berry [3] called 'the random wave conjecture'. This principle 
says that the behaviour of high-energy Laplace eigenfunctions in the case when 
the corresponding geodesic flow is ergodic (the so called 'highly excited quantum 
chaotic eigenfunctions') should resemble the behaviour of Gaussian random waves. 
More generally, one would expect that the random spherical harmonic can serve 
as a good model for the typical behaviour of high-energy Laplace eigenfunctions 
on a compact surface endowed with a smooth Ricmannian metric. 

4.1. Spherical harmonics. 

The spherical harmonic of degree n is a real-valued eigenfunction of the Lapla- 
cian (with the minus sign) on the two-dimensional sphere corresponding to the 
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eigenvalue A„ = n(n + 1). Equivalently, it is a trace of a homogeneous harmonic 
polynomial in of degree n on the unit sphere. Let Hn be the 2?! + 1-dimensional 
real Hilbcrt space of spherical harmonics of degree n equipped with the L^(S^)- 
norm. The Gaussian spherical harmonic f is the sum 

71 

fn = ^ CkYk 
k— — n 

where ^fc are independent identically distributed mean zero Gaussian (real) random 
variables with = and {Yk} is an orthonormal basis of Hn, so £||/||^2(g2) = 
1. As a random function, /„ docs not depend on the choice of the basis {Y^} in 
Hn- Since the scalar product in the Hilbert space Tin is invariant under rotations 
of the unit sphere, the distribution of the random spherical harmonic /„ is also 
rotation invariant. The covariance function of the Gaussian spherical harmonic 
equals 

£{fn{x)fniy)} = P„(cOSe(x,?/)) 

where Q{x, y) is the angle between x and y, and P„ is the Legendre polynomial of 
degree n normalized by Pn(l) = 1- 

4.2. Random plane waves. 

Now, we turn to the Gaussian plane wave. Informally speaking, it is the two- 
dimensional Fourier transform of the white noise on the unit circumference S"'^ C K^. 
More formally, we start with the Hilbert space Lgy^(§^) that consists of complex 
valued L^-functions ip on §^ satisfying the symmetry condition 

^(-A)=^, XeS\ 

and consider the Fourier image of this space TL ~ J-L'^y^{§^) with the scalar product 
inherited from igyjj(S^). The space H consists of real-analytic functions 

q>{x) = [ e'^-^(y9(A) dm(A) 
■/si 

(m is the Lebesgue measure on satisfying the Helmholtz equation A$ + $ = 0. 
The Gaussian plane wave is the sum of the random series 

k 

where r]k are standard identically distributed independent (real) Gaussian random 
variables, and is an orthonormal basis in H. The series converges almost 

surely, and its sum is again a real analytic function in satisfying the same 
Helmholtz equation. This construction does not depend on the choice of the basis 
and the distribution of the random function F is invariant with respect to 
translations and rotations of the plane (since the norm in Ji is translation and 
rotation invariant). 
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Applying the Fourier transform to the standard orthonormal basis {A'"}^^^ in 
L^(S^), we get the functions i'"Jm(r)e'™^ where {r,9) are polar coordinates, and 
Jm is the Bessel function of order m. This yields a more explicit formula for the 
Gaussian plane wave: 

Fix) = Rc ^ CmJ|rr^|(r)e'"^ X = (r,0), 

where Cm are independent identically distributed complex Gaussian random vari- 
ables with £|CmP = 2. 

The covariance function of F (which is the same as the reproducing kernel of 
the space H) is given by the Bessel kernel: 

£{F{x)F{y)} ^M\x-y\). 



It is worth mentioning that there are other constructions of random plane 
'monochromatic' waves as random linear combinations ('superpositions') of ele- 
mentary plane waves e\{x) = e'"^'^. For instance, following Oravecz, Rudnick, 
Wigman [40] and Rudnick, Wigman [43], one can consider 'arithmetic random 
waves' 

/i^(a;)=Re^C.e2"'('"") 

V 

where Ci/ are independent identically distributed complex Gaussian random vari- 
ables with i^lCmP = 2, and the sum is taken over v <^Tl? with \v'^ — N. This model 
remarkably combines analysis and probability theory with the number theory. Its 
covariance function 

£{hN{x)hN{y)} = ^cos27r(!/ • [x - y)) 

has a more erratic behaviour than the covariance functions of the Gaussian spher- 
ical harmonic and the Gaussian plane wave. 



4.3. Random plane waves as scaling limits of random spher- 
ical harmonics. 

The Gaussian plane wave is a scaling limit of the Gaussian spherical harmonic 
/„ when n oo. This is a very special case of a result of Zelditch [55j pertaining 
to a wide class of Riemannian smooth surfaces, in particular, to all real-analytic 
Ricmannian surfaces. 

Informally, for any fixed R, the restrictions of the Gaussian functions /„ on 
spherical disks of radius R/n converge as random processes to the restriction of 
F on the euclidean disk of radius R. More formally, we fix a point xq G S^, and 
define the random Gaussian function Fn on the tangent plane T^gS'^ by 



F,,{u)^{UoexpJ{^), 



(4.1) 
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where exp^^ : T^^Ei'^ — > §^ is the exponential map. After this scaling, the covariance 
equals 

£{F^{u)F,,iv)} - F„ (cose (exp,,, (^) ,exp,„ (ii))) 

When n goes to oo, the angle between the points exp^^ (^), and exp^^ (;^) on 
the sphere is equivalent to \u — v\/n (locally uniformly in u and v). Then by 
classical Hilb's asymptotics of the Legendre polynomials [501 Theorem 8.21.6], the 
scaled covariance function £{Fn{u)Fn{v)} converges to the Bessel kernel Jo{\u — v\) 
locally uniformly in u and v. 



5. Nodal portrait 

In most cases, the basic questions about the asymptotic behaviour of the nodal 
portrait of the Gaussian spherical harmonic /„ as n — )• oo. and their counterparts 
for the Gaussian plane wave in the 'large area limit' are equivalent to each other. In 
what follows, we concentrate on spherical harmonic versions which are somewhat 
easier to formulate. 

For the spherical harmonic g E Tin, we denote by Z{g) = {a; G : g{x) ~ 0} its 
nodal set. The connected components of the complement §'^\Z(g) are called nodal 
domains of g. The following (deterministic) facts are special cases of well-known 
results valid for Laplace eigenfunctions on smooth Riemannian surfaces: 

Theorem 5.1. There is a positive numerical constants C such that for each g G 
Hn, the nodal set Z{g) is a Cn^^-net on 

Theorem 5.2. There is a positive numerical constant c > such that for each 
g G Hn, every nodal domain of g contains a disk of radius cn^^. 

Together with Figure 3, this gives a very rough idea of how the nodal portraits 
of a spherical harmonic of large degree should look. 

One can find more information about the geometry and the topology of the 
nodal portraits of spherical harmonics (and more generally, of high-energy Laplace 
eigenfunctions on smooth Riemannian surfaces) in the pioneering works of Donnelly 
and Fefferman [121 [131 HI]: as well as in the more recent works of Eremenko, 
Jackobson, and Nadirashvili [TB], Mangoubi and Nazarov, Polterovich and 
Sodin |31j . Still, our understanding of nodal portraits is rather restricted, and, in 
our opinion, this classical area of analysis is very much underdeveloped. 

5.1. Length of the nodal set. 

The basic characteristics of the nodal set of a spherical harmonic g are its length 
L{g) and the number N{g) of connected components (which is one less than the 
number of nodal domains). Useful classical integral formulas for the length due 
to Poincarc and to Kac and Rice make the length a somewhat easier object for a 
study. For instance, one can prove 
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Figure 3. Nodal portrait of the Gaussian spherical harmonic of degree 40 (figure by 
A. Barnett) 

Theorem 5.3. There exists a positive numerical constant C such that for each 
g € Hn, C-^n < L{g) < Cn 

This is a special case of a more general result valid for Laplace eigenfunctions 
corresponding to large eigenvalues (with n replaced by -s/A). The lower bound is 
valid for any smooth Riemannian surface (this is a result of Briining |10|). while the 
upper bound was proven by Donnelly and Fefferman |12] for real-analytic surfaces. 
In the smooth category, it was conjectured by S. T. Yau, and still remains open 
in spite of many efforts. Note that one can easily deduce the upper bound in 
Theorem [531 from the fact that spherical harmonics are restrictions of polynomials 
(that is, without using the deep result of Donnelly and Fefferman). 

For the Gaussian spherical harmonic, Bcrard showed in [2] that 

Theorem 5.4. £i(/„) = ttV2X^, = \/2Trn + 0(1). 

The question about the variance is more delicate. Recently, Wigman |53| con- 
firmed a guess made by M. V. Berry [i] in a slightly different context: 

Theorem 5.5. For n — > oo, 

65 

variance of L(/„) = — logn -I- 0(1) . 

The proof of this theorem is based on a very careful analysis of asymptotic can- 
celations that appear in the Kac-Rice integral representation of the variance of 

L{fn). 

5.2. The number of connected components. 

There arc few classical facts about the number of components N{g). The celebrated 
Courant nodal domain theorem yields 
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Theorem 5.6. For every g <E T-Ln, N{g) < v? . 

For large n, this upper bound was improved by Pleijel [H] to 0.69n^. Apparently, 
the sharp asymptotic upper bound is not known yet. Simple examples show that 
it cannot be less than (i + o(l))n^. H. Lcwy US] gave an elegant construction of 
spherical harmonics of any degree n whose nodal sets have one component for odd 
n and two components for even n, which proves that no non-trivial lower bound 
for N{g) is possible. 

Till recently, nothing had been known about the asymptotic properties of the 
random variable N{f„) when the degree n is large. The principal difficulty is its 
non-locality: observing the nodal curves only locally, one cannot make any definite 
conclusion about the number of connected components. Several years ago Blum, 
Gnutzmann, and Smilansky [B] raised a question about the distribution of the 
number of nodal domains of high-energy Laplace eigenfunctions. In the ergodic 
case, in accordance with Berry's heuristic principle, they suggested to find this 
distribution for Gaussian random plane waves and performed the corresponding 
numerics. To compute this distribution, Bogomolny and Schmit proposed in |8] an 
elegant percolation-like lattice model for description of nodal domains of random 
Gaussian plane waves. This model completely ignores the (quite big) correlations 
between the values of the random function /„ at different points but nevertheless 
agrees with numerics pretty well. This agreement is probably due to some hid- 
den 'universality law' rather then the possibility to directly reduce one model to 
another. 

5.3. Bogomolny-Schmit percolation-like model. 

The Bogomolny-Schmit hypothesis is that the distribution of nodal domains N(f„) 
is roughly the same as in the following critical percolation model. Consider the 
square lattice with the total number of sites equal to (£L(/„)) , that is propor- 
tional to n^, and change at each site the line crossing to one of the two equiprobable 
avoided crossing, as shown in the following figure. At different sites, the changes 



Figure 4. Avoided nodal crossings in the Bogomolny-Schmit model 
are independent. 

Then Bogomolny and Schmit introduce two dual square lattices: the 'blue one' 
with vertices at the cells of the grid where the function is positive, and the 'red one' 
with vertices at the cells of the grid where the function is negative. Each realization 
of the random process generates two graphs, the blue one whose vertices are the 
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blue lattice points and the red one whose vertices are the red lattice points. Two 
vertices are connected by an edge if the corresponding cells of the grid belong to 
the same nodal domain of the random function. Each of these graphs uniquely 




Figure 5. Bond percolation on two dual lattices 



determines the whole picture, so it sufhces to consider only one of them, and 
each of them represents the critical bond percolation on the corresponding square 
lattice. Then using some heuristics coming from statistical mechanics, Bogomolny 
and Schmit predicted that for n — > oo, 

£N{U)^{a + o(l))n', 

and 

variance of N{fn) = (fe + o(l))n^, 

with explicitly computed positive numerical constants a and b. They also argued 
that the fluctuations of the random variable N{fn) are asymptotically Gaussian 
when n — > (X), and concluded their work with a remarkable prediction of the power 
distribution law for the areas of nodal domains, based on the percolation theory. 

It would be interesting to test numerically whether the Bogomolny-Schmit 
model persists for random linear combinations of plane waves e''^'^ with differ- 
ent wave numbers k. 

5.4. Rigorous results. 

Recently, we showed in [32j that, in accordance with one of the Bogomolny and 
Schmit predictions, £N{f)/n^ tends to a positive limit when n — ^ oo, though our 
proof docs not provide us with an explicit value of the limit a, so we cannot juxta- 
pose it with the one predicted by Bogomolny and Schmit. In addition, we proved 
that the random variable N{f)/n? concentrates around this limit exponentially. 
Since for any spherical harmonic g E H, the total length of its nodal set Z{g) does 
not exceed Const n, our result yields that, for a typical spherical harmonic, most 
of its nodal domains have diameters comparable to 1/??. 

Theorem 5.7 (Number of nodal domains). There exists a constant a > such 
that, for every £ > 0, we have 



N{fn) 



> e| < C(£)e 

where c{e) and C(e) are some positive constants depending on e only. 
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The exponential decay in n in Theorem 15.71 cannot be improved: we showed 
that given a positive and arbitrarily small k, 

7'{iV(/„)<«;n2} >e-^(«)". 

On the other hand, our proof of Theorem 15.71 gives a very small value c(e) ~ e"'^^ 
and it would be nice to reduce the power 15 of e to something more reasonable. 
The question about the variance of N(fn) remains open. 

The last but not least remark is that the proof of Theorem 15.71 uses only rel- 
atively simple tools from the classical analysis, which we believe may work in a 
more general setting of random functions of several real variables (and for higher 
Betti numbers), while it seems that the Bogomolny-Schmit model is essentially a 
two-dimensional one. 

5.5. Related work. 

We are aware of several encouraging attempts to tackle similar questions in 
different contexts. In |49j (motivated by some engineering problems), Swerling 
estimated from below and from above the mean number of connected components 
of the level lines Z(t, /) = {/ = t} of a random Gaussian trigonometric polynomial 
/ of two variables of a given degree n. His method is based on estimates of 
the integral curvature of the level line Z{t,f). The estimates are rather good 
when the level t is separated from zero, but as t — they are getting worse and, 
unfortunately, give nothing when t ~ 0. 

In the paper [28], Malevich considered C^-smooth Gaussian random functions 
/ on with positive covariance function that decays polynomially as the distance 
between the points tends to infinity. She proved that for T > Tq, 

C^^T^ < 8N{T) < CT^ , 

where N{T) is the number of the connected components of the zero set of / that 
are contained in the square [0, T] x [0, T], and C is a positive numerical constant. 
Her proof relies heavily on the positivity property of the covariance function that 
docs not hold for Gaussian spherical harmonics or for Gaussian trigonometric poly- 
nomials. 

In the recent paper [30] , Mischaikow and Wanner studied the following question. 
Suppose / is a random smooth function on the square [0, 1]^ with periodic boundary 
conditions and that the signs of / are computed at the vertices of the grid with 
mesh S. How small must 6 be (in terms of the a priori smoothness constants of 
f) in order to recover the Betti numbers of the sets {/ > 0} and {/ < 0} with 
probability close to one ? In particular, they show that for random trigonometric 
polynomials of two variables of degree N, it suffices to take 6 = cN^'^ where c is 
a sufficiently small positive numerical constant. It is possible that their bounds 
can be significantly improved if instead of recovering the exact values of the Betti 
numbers one tries to recover them with a small relative error. 
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6. The sketch of the proof of the theorem on the 
number of nodal domains 

Here, we will describe the main ideas behind the proof of Theorem 15.71 All the 
details can be found in |32j . 

6.1. The lower bound £N{fr,) > cn^. 

This is the simplest part of the story. Denote by the spherical distance. 

Given a point a; £ and a large positive constant C, we consider the event 

= {fn{x) > C, and /„(?/) < -C for all y satisfying d{x, = 

where p is a constant whose value will be specified below. Clearly, if the event fix 
occurs, then the disk of radius p/n centered at x contains a closed nodal line of 
fn- We claim that 

vin^) > c> 0, 

where c is a positive constant. The reason is rather straightforward: for every 
point X G there exists a function bx G Hn with \\bx\\ = 1 such that 

bx{x) > CQ\/n and bx{y) < — co-\/n whenever d{x, y) ~ — . 

n 

One can take as bx the zonal spherical harmonic with "pole" x. Then we can 
represent /„ in the form 

fn — £,obx + fx 

where is a Gaussian random variable with — 2n+i ' ^'^"^ f^ ^ Gaus- 
sian spherical harmonic with = 2n+i independent of ^nd check that 
with positive probability, the 'perturbation' fx cannot destroy a short nodal curve 
around point x provided by the function bx- 

It remains to pack the sphere by ~ -n? disjoint disks of radius 2p/n. With a 
positive probability, each of these disks contains a closed nodal line of /„. Whence, 
the lower bound for £N[fn)- 

6.2. Levy's concentration of measure principle. 

To establish the exponential concentration of the random variable N{fn) around its 
median, we would like to use a version of classical Levy's concentration of measure 
principle. 

Given a set K, we denote by i^+p the ^-neighbourhood of K. We apply this 
notation to subsets of Hn and the L^-distance, to subsets of and the usual 
spherical distance, and also to subsets of K'' with the Euclidean distance. The 
following Gaussian isoperimetric theorem is due to Sudakov and Tsirelson j45] and 
Borell [7]: 
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Theorem 6.1. Let be the standard Gaussian measure on R*^. Let E C Mf^ be a 

Borel set, and 11 be an ajfine half-space such that 

7d(i]) = 7d(n) . 

Then for each t > Q, 

7d(S+p) > 7d(n+p) . 

A simple computation shows that if 7d(n+p) is not too close to 1, then 7(i(n) 
must be exponentially small in d, like exp[— cp^d]. Applying this to the 2n + 1- 
dimcnsional space "Hn of spherical harmonics of degree n, we get 

Corollary 6.1 (Concentration of Gaussian measure on T-Ln)- Let G C Hn be 

any measurable set of spherical harmonics. Suppose that the set G+p satisfies 
V{G+p) < |. Then V{G) < 26"'='''". 

To use the concentration of measure principle, we need to show that the number 
N{f) doesn't change too much under slight perturbations of / in the L^(§^)-norm. 
Certainly, this is not true for all / e Hn, but we will show that the "unstable" 
spherical harmonics / € H„ for which small perturbations can lead to a drastic 
decrease in the number of nodal lines are exponentially rare. Here is a key lemma 
which is probably the most novel part of the whole story: 

Lemma 6.1 (Uniform lower semi-continuity of N{fn)/n?)- For every e > 0, there 
exist p > and an exceptional set E C Hn of probability V{E) < C(e)e~'^^'^^" such 
that for all f G Hn \ LJ and for all g G Hn satisfying \\g\\ < p, we have 

N{f + g) > N{f) - en^ . 

The uniform lower semi-continuity lemma readily yields the exponential con- 
centration of the random variable N{fn)/n? near its median a„. First, consider 
the set 

G={/eH„:iV(/)>(a„ + e)n2}. 

Then for f {G\ E)+p, we have N{f) > a^n^ , and therefore, V{{G \ E)+p) < \. 
Hence, by the concentration of Gaussian measure, V{G\E) < 2e-''P and finally, 

V{G) < V{G \E)+ V{E) < 2e-'p'" + C(£)e-"(-')" < C(£)e-'=(-')" . 
Now, we turn to the set 

F={feHn:Nf< (a„-e)n2}. 

Then 

F+pC {feUn-. Nf<ann^}uE, 

so that 

7'(F+p)<i + C(e)e-^(-)"<^ 
for large n, and it follows that ViF) < 26"'='''". 
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6.3. The uniform lower continuity of the functional f>-^N(f) 
outside of an exceptional set. 

Here we explain how we prove Lemma l6. II 

6.3.1. Exceptional spherical harmonics E with unstable nodal portraits. 

Instability of the nodal portrait of a spherical harmonic / £ Hn under small 
perturbations is caused by points where / and V/ are simultaneously small. Let 
a and S be small positive parameters, and let i? be a large positive parameter (all 
of them will depend on e from Lemma [Q]) . Cover the sphere S'^ by ~ R~^n? disks 
Dj of radius R/n in such a way that the concentric disks ADj with 4 times larger 
radius cover the sphere with a bounded multiplicity. We call the disk Dj stable 
if for each x G 3-Dj either > a or |V/(a:)| > an. Otherwise, the disk Dj 

is unstable. We call the spherical harmonic / € T-Ln exceptional if the number of 
unstable disks is at least Sn? , and denote by E the set of all exceptional spherical 
harmonics of degree n. 

Lemma 6.2. Given 5 > Q, there exist positive C{5) and c{5) such that 

V{E) < C(J)e""(*)" 
provided that the constant a is sufficiently small. 

Curiously, the proof of this lemma uses the concentration of measure principle 
again. It also uses the fact that given x S S^, the Gaussian random variable f{x) 
and the Gaussian random vector V/(a;) are independent. 

6.3.2. Identification of unstable connected components. It remains to 
show that at most ern^ nodal components of a stable spherical harmonic can disap- 
pear after perturbation of / by another spherical harmonic g € T-Ln with sufficiently 
small L^-norm. First, in several steps, we identify possibly 'unstable' connected 
components of the nodal set Z{f) that can disappear after perturbation, show that 
their number is small compared to n^, and discard them. Then we verify that all 
other connected components of Z{f) do not disappear after the perturbation. 

First , we discard the nodal components F whose diameter is bigger than R/n. By 
the upper bound in the length estimate in Theorem l5.3[ their number is < CR~^n^ 
which is small compared to n"^. 

With each remaining component F of the nodal set Z{f) we associate a disk 
Dj such that DjOT ^ 0. Then F C 2Dj. Since each nodal domain contains a disk 
of radius c/n fThcorem 15. 2p . the number of components F intersecting Dj (and, 
thereby, contained in 2Dj) is bounded. 

Second , we discard the components F with unstable disks Dj. Since / is not ex- 
ceptional, and since each disk Dj cannot intersect too many components contained 
in 2Dj, the number of such components is also small compared to n^. 

At last , we discard the components F such that 

max I q I > a . 
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To estimate the number A'' of such disks, we denote by D* C 4Dj the disk of radius 
1/n centered at the point yj where \g\ attains its maximum in 3Dj. By standard 
cUiptic estimates, 

/ \gf>n-%{y,)\^a'n-\ 

JD* 

whence 

> II5||l2(s2) > Na^n~'^ , 

that is, N < p^a^^v?. As above, we conclude that the number of components F 
affected by this is < R^N < p^a^^n^ which is much less than en^ provided that 
is much less than ea^R^^ . 

6.3.3. Verification of stability of the remaining connected components. 

Now, we claim that the remaining components T cannot be affected by the per- 
turbation of / by g. To see this, we consider the connected component A-c{t) of 
the set {|/| < t} that contains F, and look what may happen with this component 
when t grows from to a. 

• As long as ^r(i) stays away from the boundary d{3Dj), it cannot merge 
with another component of {|/| < t} because such a merge can occur only 
at a critical point of / and there are none of them in Ar{t) D iDj. 

• For the same reason, neither of the two boundary curves of (t) can collapse 
and disappear. 

• At last, Ar{t) cannot reach d{3Dj) before it merges with some other com- 
ponent: indeed, if a; € Ar(t) and Ar(t) lies at a positive distance from the 
boundary d{3Dj) then we can go from x in the direction of V/ if f{x) < 
and in the direction of —V/ if f{x) > 0. In any case, since |V/| > an in 
Ar{t), we shall reach the zero set Z{f) after going the length 1/n or less. 
Since F is the only component of Z(f) in Ar{t) before any merges, we con- 
clude that Ar{t) c F+i/„. Recalling that dist (F, 9(3£)j) ) > R/n, we see 
that, for each t < a, Ar{t) stays away from the boundary d{3Dj). 

Thus, each component F lies in a topological annulus Ar = Ar{ct) which is 
contained with its boundary in the open disk 3Dj and such that / — +a in one 
boundary curve of Ar and / = —a on the other. Recalling that \g\ < a in 3Z)j, 
we conclude that Z{f + g) has at least one connected component in Ay- 

6.4. Existence of the limit of £N{fr,)/n^ . 

We already know that £N{fn) > n'^ and that N{fn)/n'^ concentrates near its 
median exponentially. Thus, to finish the proof of Theorem l5.7[ it remains to show 
that the sequence {£N{f„)/n'^} converges. We deduce this from the fact that the 
Gaussian spherical harmonic has a scaling limit combined with rotation invariance 
of the distribution of /„. Since this part does not require any new ideas beyond 
the ones we've already introduced, we just refer the reader to |32] for the details. 
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6.5. Comments and questions. 

Making use of a non-critical version of their percolation model, Bogomolny and 
Schmit obtained in [9] a series of predictions for the behaviour of the components 
of level sets which agree with numerics. In a stark contrast, we do not have a 
rigorous answer even to the following most basic question: 

Question 6.1. Prove that for each e>0 and each rj > 0, the probability that the 
level set |.t G §^ : fn{x) >e} has a component of diameter larger than rj tends to 
zero as n ^ oo. 

One of the reasons for our ignorance is the aforementioned non-locality of the 
number of connected components. Another essential difficulty is a very slow decay 
of the correlations which docs not allow us to think of restrictions of our process 
to a collection of wcll-scparatcd disks as of almost independent processes. 

Question 6.2. Estimate the mean number of large components of the nodal set 
whose diameter is much bigger than 1/n. For instance, of those whose diameter is 
comparable to n~'^ with < a < 1. 

Nothing is known about the number of connected components of the nodal 
set for 'randomly chosen' high-energy Laplace eigenfunction f\ on an arbitrary 
compact surface M without boundary endowed with a smooth Riemannian metric 
g. It is tempting to expect that Theorem 15 . 71 models what is happening when M is 
the two-dimensional sphere §^ endowed with a generic Riemannian metric g that 
is sufficiently close (with several derivatives) to the constant one. 

Instead of perturbing the 'round metric' on the sphere S'^, one can add a small 
potential V to the Laplacian on the sphere. The question remains just as hard. 
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